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DETAILED ACTION 

1. This action is responsive to application filed on Dec. 19 2001 . Claims 1 - 66 are 
pending examination. 

Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

2. Claims 1 - 3, 5, 6, 8, 10, 14 - 18 - 26, 28, 29, 31 - 37, 39, 42 - 46, 49, 50, 53 - 
56, 58 - 63 and 66 are rejected under 35 U.S.C. 102(e) as being anticipated by Britton 
et al U.S. Patent No.6,535,896 (referred to hereafter as Britton). 

Britton teaches the invention explicitly as claimed. Britton teaches systems , methods 
and computer program products for utilizing XML - based tools to tailor HTML - based 
web page content for display within various client devices (see abstract). 

As to claim 1 , Britton teaches a method of transcoding information in a first 
markup language into a second markup language, the method comprising the steps of: 
responding to a request to view a Web page by retrieving information from said 
Web page, wherein said information is in a first markup language (see col. 3 lines 28 - 
39, Britton discloses requesting an HTML- based web page ) ; 



Application/Control Number: 10/029,700 Page 3 

Art Unit: 2157 

normalizing said information(see col. 3 lines 28 - 43 Britton discloses 
transferring the HTML - based web page format to XML format ); 

determining a second markup language that can be used by a browser using 
device detection, wherein said browser is used by a computer that is to view said 
information ( see col. 3 lines 40 - 48,Britton discloses the modified portions of XML 
document are converted back to HTML format) ; and 

transcoding said information into said second markup language( see col. 3 lines 
40 - 48, Britton discloses the conversion of the document to HTML and sent the web 
page with modified content to a client device for display). 

As to claim 2, Britton teaches the method of claim 1 , further comprising the step 
of sending said information in said second markup language to said computer (see col. 
3 lines 40 - 48, Britton discloses the conversion of the document to HTML and sent the 
web page with modified content to a client device for display). 

As to claim 3, Britton teaches the method of claim 2, wherein said computer is a 
wireless mobile device (see col. 3 lines 28 - 38. Britton discloses sending of a web page 
to pervasive computing devices). 

As to claim 5, Britton teaches the method of claim 2 wherein the step of sending 
said information in said second markup language to said computer comprises sending 
said information to said computer using automatic page division (see col. 3 lines 40 - 
48, Britton discloses the conversion of the document to HTML and sent the web page 
with modified content to a client device for display). 
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As to claim 6, Britton teaches the method of claim 1 , wherein said step of 
transcoding comprises the steps of: 

selecting a renderer that is associated with said second markup language from a 
plurality of Tenderers associated with markup languages ( see col. 3 lines 43 - 48, 
Britton discloses converting back to HTML format); 

sending said information through said renderer ( see col. 3 lines 40 - 48, Britton 
discloses the conversion of the document to HTML and sent the web page with modified 
content to a client device for display); and 

transcoding said information into said second markup language using said 
renderer( see col. 3 lines 40-48, Britton discloses the conversion of the document to 
HTML format). 

As to claim 8, Britton teaches the method of claim 1 , wherein said step of 
normalizing comprises the step of transcoding said information in said first markup 
language into an intermediate markup language (see col. 5 lines 1-11 Britton 
discloses that XSL can be used to transform an XML document into one HTML view). 

As to claim 1 0, Britton teaches the method of claim 1 , wherein said second 
markup language comprises the Extensible Markup Language (XML) (see col.6 lines 10 
- 14 Britton discloses HTML web page content format converted to XML format). 

As to claim 14,Britton teaches the method of claim 1, wherein said second 
markup language comprises the HyperText Markup Language (HTML) (see col. 3 lines 
40 49, Britton teaches HTML language). 
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As to claim 1 5, Britton teaches the method of claim 1 , wherein said steps of 
responding, normalizing, determining, and transcoding occur automatically ( see col. 3 
lines 40 - 48, Britton discloses the conversion of the document to HTML and sent the 
web page with modified content to a client device for display). 

As to claim 16, Britton teaches the method of claim 1 , wherein said first markup 
language comprises the HyperText Markup Language (HTML) (see col. 3 lines 40 49, 
Britton teaches HTML language). 

As to claim 1 7, Britton teaches the method of claim 1 , further comprising the step 
of sending said information in said second markup language to said computer over a 
system of networked computers (see fig. 2 Britton discloses a pervasive computing 
device in a network). 

As to claim 1 8, Britton teaches the method of claim 1 , wherein a first object 
embodies said information in said first markup language and said step of transcoding 
further comprises automatic object conversion of said first object to a second object 
embodying said information in said second markup language (see col. 3 lines 40-48, 
Britton discloses the conversion of the document to HTML format). 

As to claim 19, Britton teaches the method of claim 1 , further comprising 
providing an error logging system (see col. 4 lines 54 - 60 Britton discloses reporting 
errors). 

As to claim 20, Britton teaches the method of claim 1, wherein said second 
markup language is a markup language other than the HyperText Markup Language 
(HTML) (see col. 5 lines 1-13 Britton discloses Extensible Stylesheet Language). 
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As to claim 21 , Britton teaches the method of claim 1 , wherein said device 
detection comprises referring to an HTTP user agent header field (see col. 6 line 66 - 
col. 35, Britton discloses the hypertext transfer protocol). 

As to claim 22, Britton teaches the method of claim 1 , wherein said device 
detection comprises detecting said browser and said computer using unique signature 
detection (See col. 3 lines 7-14, Britton discloses a modified web browser). 

As to claim 23, Britton teaches the method of claim 1 , further comprising dividing 
said information in said second language into at least two pages using automatic page 
division (see col. 3 lines 50 - 62 Britton discloses web pages having a mixture of 
formats to be converted to a single format). 

As to claim 24, Britton teaches a method of transcoding information in a first 
markup language into a second markup language, the method comprising the steps of: 

responding to a request to view a Web page via a computer (see col. 3 lines 28 - 
39, Britton discloses requesting an HTML- based web page ); 

retrieving information from said Web page, wherein said information is in a first 
markup language, normalizing said information (see col. 3 lines 28 - 43 Britton 
discloses transferring the HTML - based web page format to XML format ); and 

transcoding said information into a second markup language, wherein said 
computer is adapted for utilizing said second markup language( see col. 3 lines 40 - 48, 
Britton discloses the conversion of the document to HTML and sent the web page with 
modified content to a client device for display). 
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As to claim 25, Britton teaches the method of claim 24, wherein said step of 
normalizing comprises the step of transcoding said information in said first markup 
language into an intermediate markup language (see col. 5 lines 1-11 Britton 
discloses that XSL can be used to transform an XML document into one HTML view). 

As to claim 26, Britton teaches the method of claim 24, wherein said computer is 
a wireless mobile device (see col. 3 lines 28 - 38. Britton discloses sending of a web 
page to pervasive computing devices). 

As to claim 28, Britton teaches the method of claim 24, further comprising 
dividing said information in said second language into pages using automatic page 
division (see col. 3 lines 50 - 62 Britton discloses web pages having a mixture of 
formats to be converted to a single format). 

As to claim 29, Britton teaches the method of claim 24, wherein said step of 
transcoding comprises the steps of: 

determining said second markup language, wherein said computer is adapted 
for utilizing said second markup language( see col. 3 lines 40 - 48,Britton discloses the 
modified portions of XML document are converted back to HTML format); 

selecting a rendererthat is associated with said second markup language from a 
plurality of Tenderers associated with markup languages( see col. 3 lines 40 - 48, 
Britton discloses the conversion of the document to HTML format); 

sending said information through said renderer ( see col. 3 lines 40 - 48, Britton 
discloses the conversion of the document to HTML and sent the web page with modified 
content to a client device for display) ; and 
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transcoding said information into said second markup language using said 
renderer ( see col. 3 lines 40 - 48, Britton discloses the conversion of the document to 
HTML format). 

As to claim 31 , Britton teaches the method of claim 24, wherein said steps of 
responding, retrieving, normalizing, and transcoding occur automatically (see col. 3 
lines 40 - 48, Britton discloses the conversion of the document to HTML and sent the 
web page with modified content to a client device for display). 

As to claim 32, Britton teaches the method of claim 24, wherein a first object 
embodies said information in said first markup language and said step of transcoding 
further comprises automatic object conversion of said first object to a second object 
embodying said information in said second markup language (see col. 3 lines 40-48, 
Britton discloses the conversion of the document to HTML format). 

As to claim 33, Britton teaches the method of claim 24, further comprising 
providing an error log that reports errors that occur during at least one of said steps of 
responding, retrieving, normalizing, and transcoding (see col. 3 lines 40 - 48, Britton 
discloses the conversion of the document to HTML and sent the web page with modified 
content to a client device for display). 

As to claim 34, Britton teaches the method of claim 24, wherein said second 
markup language is a markup language other than the HyperText Markup Language 
(HTML) (see col. 5 lines 1-13 Britton discloses Extensible Stylesheet Language). 

As to claim 35, Britton teaches the method of claim 24, further comprising the 
steps of: detecting a browser of said computer; and determining said second markup 
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language that is used by said browser based on said step of detecting (see col. 6 line 
66 - col. 35, Britton discloses the hypertext transfer protocol). 

As to claim 36, Britton teaches a method of transcoding information in a first 
markup language into a second markup language, the method comprising the steps of: 

responding to a request to view a Web page language (see col. 3 lines 28 - 39, 
Britton discloses requesting an HTML- based web page ); 

retrieving information from said Web page, wherein said information is in a first 
markup language(see col. 3 lines 28 - 43 Britton discloses transferring the HTML - 
based web page format to XML format ); 

device detection to determine said second markup language that is used by said 
browser(See col. 3 lines 7-14, Britton discloses a modified web browser) ; and 

transcoding said information into a second markup language, wherein said 
computer is adapted for utilizing said second markup language( see col. 3 lines 40 - 48, 
Britton discloses the conversion of the document to HTML and sent the web page with 
modified content to a client device for display). 

As to claim 37, Britton teaches the method of claim 36, wherein said computer is 
a wireless mobile device (see col. 3 lines 28 - 38. Britton discloses sending of a web 
page to pervasive computing devices). 

As to claim 39, Britton teaches the method of claim 36, wherein said step of 
transcoding comprises the steps of: 
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selecting a renderer that is associated with said second markup language from a 
plurality of Tenderers associated with markup languages( see col. 3 lines 40 - 48, 
Britton discloses the conversion of the document to HTML format); 

sending said information through said renderer ( see col. 3 lines 40 - 48, Britton 
discloses the conversion of the document to HTML and sent the web page with modified 
content to a client device for display) ; and 

transcoding said information into said second markup language using said 
renderer ( see col. 3 lines 40 - 48, Britton discloses the conversion of the document to 
HTML format). 

As to claim 41 , Britton teaches the method of claim 36, wherein said steps of 
responding, retrieving, device detection and transcoding occur automatically (see col. 3 
lines 40 - 48, Britton discloses the conversion of the document to HTML and sent the 
web page with modified content to a client device for display). 

As to claim 42, Britton teaches the method of claim 36, further comprising 
dividing said information in said second language into pages using automatic page 
division (see col. 3 lines 40 - 48, Britton discloses the conversion of the document to 
HTML and sent the web page with modified content to a client device for display). 

As to claim 43, Britton teaches the method of claim 36, wherein a first object 
embodies said information in said first markup language and said step of transcoding 
further comprises automatic object conversion of said first object to a second object 
embodying said information in said second markup language (see col. 3 lines 40 - 48, 
Britton discloses the conversion of the document to HTML format). 
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As to claim 44, Britton teaches the method of claim 36, further comprising 
transcoding said information in said first markup language into an intermediate markup 
language prior to transcoding said information into second markup language (see col. 5 
lines 1-11 Britton discloses that XSL can be used to transform an XML document into 
one HTML view). 

As to claim 45, Britton teaches a system for viewing a Web page by a computer 
that utilizes a markup language, the system comprising: 

a computer, wherein said computer requests to view a Web page(see col. 3 lines 
28 - 39, Britton discloses requesting an HTML- based web page ); 

information from said Web page, wherein said information is in a first markup 
language; a device detector, wherein said device detector determines a second markup 
language that said computer utilizes( see col. 3 lines 40 - 48,Britton discloses the 
modified portions of XML document are converted back to HTML format); and 

a renderer, wherein said renderer transcodes said information into said second 
markup language, wherein said information is sent to said computer( see col. 3 lines 40 
- 48, Britton discloses the conversion of the document to HTML and sent the web page 
with modified content to a client device for display). 

As to claim 46, Britton teaches the system of claim 45, further comprising: 

a normalizer, wherein said normalizer transcodes said information in said first 
markup language into an intermediate markup language(see col. 3 lines 28-43 Britton 
discloses transferring the HTML - based web page format to XML format). 
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As to claim 49, Britton teaches the system of claim 45, wherein said computer 
utilizes a markup language other than the HyperText Markup Language (HTML) (see 
col. 5 lines 1-13 Britton discloses Extensible Stylesheet Language). 

As to claim 50, Britton teaches the system of claim 45, wherein said computer is 
a wireless mobile device (see col. 3 lines 28 - 38. Britton discloses sending of a web 
page to pervasive computing devices). 

As to claim 53, Britton teaches the system of claim 45, wherein said information 
in said second markup language is sent to said computer over a system of networked 
computers (see fig. 2 Britton discloses a pervasive computing device in a network). 

As to claim 54, Britton teaches the system of claim 45, wherein a first object 
embodies said information in said first markup language and said renderer uses 
automatic object conversion to convert said first object to a second object embodying 
said information in said second markup language (see col. 3 lines 40 - 48, Britton 
discloses the conversion of the document to HTML format). 

As to claim 55, the system of claim 45, further comprising an error logging 
system (see col. 4 lines 54 - 60 Britton discloses reporting errors). 

As to claim 56, Britton teaches the system of claim 45, wherein said second 
markup language is a markup language other than the HyperText Markup Language 
(HTML) (see col. 5 lines 1-13 Britton discloses Extensible Stylesheet Language). 
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As to claim 57, Britton teaches the system of claim 45, wherein said device 
detector uses unique signature detection (See col. 3 lines 7-14, Britton discloses a 
modified web browser). 

As to claim 58, Britton teaches a system for viewing a Web page by a computer 
that utilizes a markup language other than the Hypertext Markup Language (HTML), 
the system comprising: 

a computer, wherein said computer requests to view a Web page (see col. 3 
lines 28 - 39, Britton discloses requesting an HTML- based web page ) ; 

information from said Web page, wherein said information is in a first markup 
language(see col. 3 lines 28 - 39, Britton discloses requesting an HTML- based web 
page ); 

a normalizer, wherein said normalizer normalizes said information in said first 
markup language into an intermediate markup language(see col. 3 lines 28 - 43 Britton 
discloses transferring the HTML - based web page format to XML format ); and 

a renderer, wherein said Tenderer transcodes said information in said 
intermediate markup language into a second markup language, wherein said second 
markup language is a markup language that said computer utilizes and said second 
markup language is a markup language other than HTML (see col. 5 lines 1-11 Britton 
discloses that XSL can be used to transform an XML document into one HTML view). 

As to claim 59, Britton teaches the system of claim 58, further comprising: 
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a device detector, wherein said device detector determines said second markup 
language based on a browser of said computer (see col. 6 line 66 - col. 35, Britton 
discloses the hypertext transfer protocol). 

As to claim 60, Britton teaches the system of claim 58, wherein said computer is 
a wireless mobile device (see col. 3 lines 28 - 38. Britton discloses sending of a web 
page to pervasive computing devices). 

As to claim 61 , Britton teaches the system of claim 58, wherein a first object 
embodies said information in said first markup language and said renderer uses 
automatic object conversion to convert said first object to a second object embodying 
said information in said second markup language (see col. 3 lines 40 - 48, Britton 
discloses the conversion of the document to HTML format). 

As to claim 62, Britton teaches computer executable process steps operative to 
control a computer, stored on a computer readable medium, comprising: 

a plurality of steps to receive data required for subsequent calculations(see col. 3 
lines 28 - 43 Britton discloses transferring the HTML - based web page format to XML 
format ); and 

a plurality of steps to automatically transcode information in a first markup 
language into a second markup language, wherein said second markup language is 
automatically determined ( see col. 3 lines 40-48, Britton discloses the conversion of 
the document to HTML and sent the web page with modified content to a client device 
for display). 



Application/Control Number: 10/029,700 Page 15 

Art Unit: 2157 

As to claim 63, Britton teaches the steps of claim 62, further comprising a step to 
automatically normalize said information in said first markup language prior to 
transcoding said information into said second markup language (see col. 3 lines 28 - 43 
Britton discloses transferring the HTML - based web page format to XML format). 

conversion of the document to HTML format). 

As to claim 66, Britton teaches the method of claim 64, further comprising 
dividing said information in said second language into pages using automatic page 
division (see col. 3 lines 50 - 62 Britton discloses web pages having a mixture of 
formats to be converted to a single format). 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

3. Claims 4, 7, 27, 30, 38, 40, 47, 48, 51 , 52, 64 and 65 are rejected under 35 

U.S.C. 103(a) as being unpatentable over Britton in view of Vega - Garcia et al. 

U.S.Patent No. 6,839,734 (referred to hereafter as Vega-Garcia). 

As to claims 4, 7, 27, 30, 38, 40, 47, 48, 51 and 52, Britton teaches systems , 

methods and computer program products provided utilizing XML-based tools to tailor 
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HTML-based Web page content for display within various client devices. Moreover 
Britton teaches a method of transcoding information in a first markup language into a 
second markup language, the method comprising the steps of: 

responding to a request to view a Web page by retrieving information from said 
Web page, wherein said information is in a first markup language (see col. 3 lines 28 - 
39, Britton discloses requesting an HTML- based web page ) ; 

normalizing said information(see col. 3 lines 28-43 Britton discloses 
transferring the HTML - based web page format to XML format ); 

determining a second markup language that can be used by a browser using 
device detection, wherein said browser is used by a computer that is to view said 
information ( see col. 3 lines 40 - 48,Britton discloses the modified portions of XML 
document are converted back to HTML format) ; and 

transcoding said information into said second markup language( see col. 3 lines 
40 - 48, Britton discloses the conversion of the document to HTML and sent the web 
page with modified content to a client device for display). 

Britton does not teach streaming information in real time. However, Vega-Garcia 
teaches multimedia communications software with network streaming and multi-format 
conference (see col. 4 lines 20 - 27). It would have been obvious to one of the ordinary 
skill in the art to include streaming information in real time as in Britton's because doing 
so would enable a user to avoid delay entailed in downloading an entire file and then 
playing it . 
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As to claims 64 and 65, Britton teaches the invention as mentioned above. 
Britton does not teach streaming information in real time. However, Vega-Garcia 
teaches multimedia communications software with network streaming and multi-format 
conference (see col. 4 lines 20 - 27). It would have been obvious to one of the ordinary 
skill in the art to include streaming information in real time as in Britton's because doing 
so would enable a user to avoid delay entailed in downloading an entire file and then 
playing it . 

4. Claims 9 and 11-13 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Britton. 

Britton teaches the intermediate markup language is XML and the second format is 
Html. Britton does not explicitly teach the intermediate markup language is XHTML, and 
the second markup language is WML, cHTML or HDML. Official notice is taken that it 
would have been obvious for one of the ordinary skill in the art at the time of the 
invention was made to modify Britton by using the markup languages because doing so 
allow the system to use different formats of markup languages that are extremely 
simple" dialect of language suitable for use on the World-Wide-Web and therefore does 
not require use of specialized software on the client device. 

Conclusion 

The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

- System and Method for Cooperative Client/Server Customization of Web 
pages by Fields et al. U.S. Patent No. 6,412,008. 
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- System In which a Proxy- Server Translates Information Received From the 
Internet Into A Form/Format readily Usable By Low Power Portable 
computers by Kiknis U.S. Patent No. 5,727,159. 

- System For Dynamically Transcoding Data Transmitted Between Computers 
by Tso et al U.S.Patent No. 6,421,733 

- An Active Transcoding Proxy to Support Mobile Web Access by Harini 
Bharadvaj ( University of Missouri- Columbia). 

Transcoding Internet Content For Heterogeneous Client Devices. By John R. 
Smith, Rakesh Mohan and Chung- Sheng Li, pages III-599 - III- 602.Any inquiry 
concerning this communication or earlier communications from the examiner should be 
directed to Sargon N Nano whose telephone number is (571 ) 272-4007. The examiner 
can normally be reached on 8 hour. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Ario Etienne can be reached on (571) 272-4001. The fax phone number for 
the organization where this application or proceeding is assigned is 703-872-9306. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
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ABSTRACT 

There is a growing diversity of client devices that have 
access to the Internet. However, much of the content on 
the Internet cannot be handled by the devices that have 
limited communication, processing, storage and display ca- 
pabilities. In order to improve the utility of a wide range 
of client devices, we propose a network-based solution for 
transcoding Internet content. The system uses an InfoPy- 
ramid for representing and transcoding video, images, au- 
dio and text. The InfoPyramid manipulates the content 
along the dimensions of fidelity and modality, and aggre- 
gates the methods for content analysis, translation, filter- 
ing and selection. The InfoPyramid utilizes a policy engine, 
which incorporates user and publisher preferences, various 
transcoding policies, device descriptions, and real-time net- 
work constraints in order to adapt the Internet content to 
the client devices. 



1.1. Related work 

Recently, several systems have been developed for adapting 
Internet content to client devices. Fox, et al., developed 
a system for distilling, or compressing images that pass 
through an Internet proxy [3]. Other commercial systems 
such as Intel's Quick Web [4] and Spyglass' Prism [5] sim- 
ilarly compress the images that pass through the Internet 
service provider to speed-up download time. 

We have developed a content-based system for transcod- 
ing images [6]. This system retrieves and analyses images in 
the Internet and classifies them into image type and purpose 
classes in order to select appropriate methods for transcod- 
ing them. Since the system is limited to transcoding images, 
we propose a more powerful solution for Internet content 
that includes video, images, text and audio. The system 
transcodes the content along the dimensions of fidelity and 
modality in order to better adapt it to a large variety of 
client devices. 



1. INTRODUCTION 



1.2. Outline 



Many new devices, such as personal digital assistants (PDAs), 
hand- held computers, Internet- ready phones, and WebTVs, 
are gaining access to the Internet. The capabilities of these 
devices to receive, process, store and display Internet con- 
tent varies widely. Given the variety of client devices, it is 
difficult for Internet content publishers to tailor content' lb™ " 
the individual devices. 

Internet content publishers do not have many options 
for customising the content. In some cases, the publish- 
ers manually generate secondary, text-only versions of Web 
pages that the users can select instead of the originals. 
Other mechanisms within the HTTP protocol allow the 
client to specify some client attributes, such as the preferred 
language of the user, or the image, video, and audio formats 
supported by the client device. Using this information, the 
content server can automatically select and deliver content 
that is compatible with the client device and the user. 

In the case of client devices with minimal capabilities, 
such as pagers and cell phones, special markup languages 
are being developed, such as HDML ([1]). However, mech- 
anisms still need to be developed to automatically convert 
Internet content into these formats. The emerging XML 
markup language may improve the capabilities for adapt- 
ing content since separate style sheets can be developed 
for client devices that determine how the content is dis- 
played [2]. 



In this, paper, we propose a system for transcoding Internet 
content. In Section 2, we present an overview of the Internet 
content transcoding system. In Section 3, we present the 
InfoPyramid framework for representing and manipulating 
video, images,, audio and text. In Section 4, we describe 
-processes for selecting , the. : vcrsions of the content to adapt, 
thercontent .toithe clients according to capabilities and pref- 
erences. Finally, in Section 5, we describe the deployment 
of the transcoding system in the Internet as transcoding 
proxies. 

2. INTERNET CONTENT TRANSCODER 

Figure 1 illustrates the proposed Internet content transcod- 
ing system. The system retrieves and analyzes the Internet 
content and ingests it into the InfoPyramid format. A pol- 
icy engine gathers the capabilities of the client, the network 
conditions and the transcoding preferences of the user and 
publisher. This information is used to define the transcod- 
ing options for the client. The system then selects the out- 
put versions of the content and uses a library of content 
analysis, filtering, translation and manipulation routines to 
generate the content to be delivered to the client device. 

The Internet content transcoding system may be de- 
ployed at the server, proxy or client. Deployed at a proxy, 
the system is able to retrieve the Internet content, analyze 
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Figure 1: The Internet content transcoding system adapts the content to the capabilities of the client devices. 



and transcodc it, and deliver the results to the client, on- 
the-fly. Deployed at the server, the system can be used 
in the content publication process. The system can pre- 
materialize the alternate versions of the Internet content 
and store them at the server. In this case, the system merely 
selects the versions of the content to deliver to the client. 
In some cases, the transcoding system can be deployed at 
the client to customize the content display, such as accord- 
ing the user preferences, as long as the client has sufficient 
capabilities. 

3. INFOPYRAMID FRAMEWORK 

The InfoPyramid provides a general framework for han- 
dling the Internet content. It allows specialized methods to 
be plugged-in for analyzing, filtering, translating and ma- 
nipulating the Internet content. As depicted in Figure 2, 
the InfoPyramid develops a conceptually redundant repre- 
sentation of the Internet content that aggregates multiple 
versions of the content along the dimensions of modality 
(video, image, text, and audio) and fidelity (which includes 
summarized and compressed versions) [7]. The translation 
and summarization methods generate the alternate versions 
of the content as needed. 

3.1. Translation and summarization 

The translation methods convert the content between modal- 
ities, such as, text to audio, or video to images. On the 
other hand, the summarization methods generate versions 
within the same modality, but with different fidelity. For 
example, the summarization methods compress the images, 
summarize text, and extract and re-animate the key-frames 
from video. The translation and summarization methods 
can be cascaded to change both the modality and fidelity 
of the content. In this case, a video can be converted to a 
small, reduced-color image. 

For each of the modalities, we describe some of the sum- 
marization and translation methods that could be used to 
change the fidelity and modality of the content, respectively. 




1. Images: 



Fidelity - Spatial size reduction, color depth re- 
duction, lossy compression [6]. 



MODALITY 

Figure 2: The InfoPyramid aggregates multiple represen- 
tations of the content along the dimensions of fidelity and 
modality and unifies the methods for manipulating the con- 
tent. 



• Modality - Images to text: related text, embed- 
ded text, semantic labels [8]. 

" 2V. Video: \ \ \ wzj rS-^. : 

. — • Fidelity - Spatial size, temporal size, playback 
rate, bit-rate. 

• Modality - Video to images: key-frames; Video 
to audio: sound track; Video to text: closed 
captions, embedded text. 

3. Text: 

• Fidelity - Key-term extraction, text summa- 
risation, document headings extraction. 

• Modality - Text to audio: speech synthesis, 
Text to text: language translation (i.e., English 
to French). 

4. Audio: 

• Fidelity - Bit-rate reduction, stereo to mono. 

• Modality - Audio to text: speech recognition. 

In order to evaluate content alternatives, the system 
could assign content value scores to each of the content 
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alternatives. Using the content value scores, the system is 
able to optimise the selection of the content according to 
the device capabilities, preferences and network conditions. 



4. TRANSCODING SYSTEM 

We propose that the transcoding system utilizes a policy 
engine to evaluate the alternatives for adapting the content 
to the client devices. 



4.1. Policy Engine 

The policy engine would gather the capabilities of the client 
and the transcoding preferences of the user and publisher, 
and sense the network conditions to define the transcoding 
options for the client. In order adapt the Internet content to 
these devices, the transcoding proxy generates and selects 
versions of the content according to the policies, network 
and device constraints, and preferences. 



4.1.1. Content value scores 

The InfoPyramid system provides the mechanism for as- 
signing content value scores to the alternate versions of the 
content. In some cases, the content value scores are derived 
automatically by measuring the loss in fidelity that results 
from translating or summarizing the content. For example, 
the content value scores can be linked to the distortion in- 
troduced from compressing the images or audio. Otherwise, 
the content value scores can be tied directly to the methods 
that manipulate the content. 



FIDELITY 




MODAUTY 

Figure 3: Example Internet content value scores assigned 
for video. 

Figure 3 illustrates examples of content value scores that 
can be assigned for transcoding video. In this example, 
the original video has the highest content value. Each ma- 
nipulation of the video along the dimension of fidelity or 
modality alters the content value. For example, converting 
the video to a sequence of images results in a small reduc- 
tion in content value. Converting the video to a highly 
compressed audio track produces a higher reduction in the 
content value. 



4.1.2. User and publisher preferences 

The content value scores comprise only part of the infor- 
mation that can be used in the content selection process. 
Both the publisher and user may have preferences for how 
the content is transcoded. Figure 4 illustrates some exam- 
ple preferences for transcoding images. In this example, the 
preferred versions for the image content are the low-fidelity 
versions of the image, and the translations to text. 
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Figure 4: Internet content format preferences may be es- 
tablished by the publisher or user. This example illustrates 
the preferred content alternatives for images. Device con- 
straints may further score or eliminate some content alter- 
natives. 



4.1.3. Device constraints 

Various constraints of the devices affect the selection of the 
content. For example, many devices cannot handle video. 
In this case, the corresponding content alternatives can be 
eliminated (see Figure 4). Overall, the display, storage and 
processing capabilities of the client devices eliminate the 
selection of individual versions of the content. These also 
. place constraints on the set of selections for a L .Web docu- 
ment For e^&m^ r X-ilieX^y?9^ hs*Va local storage of only 
16 KB, then this places a hard limit oh the total size of the 
versions of the content selected. 

4.1.4. Network constraints 

Similarly, the network would place constraints on the con- 
tent selection process. In general, the network constraints 
introduce a trade-off between content data size and load 
time. For example, if the user specifies a maximum load 
time for a page, then to accommodate this load time, the 
transcoder system must sense the end-to-end bandwidth to 
derive the target data size for the content. Then, the sys- 
tem can select the content of maximum content value that 
is within the target data size. 

4.2. Content Selection 

We investigate more closely how the system could optimize 
the selection of the content alternatives. Consider the Web 
document with two objects, A and B, depicted in Figure 5. 
Let Aij = Tij(A) and B ki = T h i(B) transcode A and B t 
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original content transcodcd content 



Figure 5: Example of content selection for a Web document 
consisting of two objects, A and B. 

respectively, to target modalities % and k, and fidelities j 
and I. Let V(Aij) and V(Bki) be the content value scores 
for Aij and Bm, and let S(A^) and S(B k i) be the data sises 
oiAij and Bki. Finally, let St be the maximum data size for 
the content, which may have been derived from the user's 
specified maximum load-time and the network conditions. 

4.2.1. Maximum content value 

The content selection process selects A-j and with max- 
imum content value for a target data size as follows: 

K(^)-fV(B; ( ) = .mKtW + W), and 
S(j4?;) + S(B;«) < St. (1) 

4.2.2. Minimum load time 

Given a minimum acceptable content value Vr, the content 
alternatives A*j and BJ t of minimum data size are given by: 

S(^) + £(#,) = mui(S(A-i) + S(iM) ( and 

+ > Vr. (2) 

4.2.3. Device constraints and preferences 

By extending this optimization process, the content selec- 
tion system could incorporate the user (U(A^)) and pub- 
lisher (P(A{j)) preferences, and client device constraints 
(D(i4jy)), to best adapt the content. In this case, the total 
score of each item k in the Web document is given by 

Z{A h {j ) = u v V* + »*U % ) + v P Pli + u>*D}j, 

where w p , and u<t are weighting factors assigned by 

the user or system. 

5. REAL-TIME, NETWORK- BASED TRANSCODING 

The transcoding system can be implemented in the net- 
work in the form of a transcoding proxy (TP) to perform 
transcoding on-t he-fly, sec Figure 6. The transcoding proxy 
system is designed to have a high bandwidth connection to 
the content provider. In most cases, the proxy has a low 
bandwidth connection to the client. As a result, reduc- 
ing the amount data through compression and transcoding 
at the proxy results in faster delivery, even when account- 
ing for the added time for content analysis, selection and 
transcoding [0], 




Figure 6: Network-based transcoding of Internet content 
using transcoding proxies (TP). 



6. SUMMARY 

We proposed a system for the network-based transcoding 
of Internet content in order to improve the accessibility of a 
wide range of client devices to the content on the Internet. 
The transcoding system retrieves, analyzes, and ingests the 
Internet content into an InfoPyramid representation. The 
InfoPyiamid provides a conceptual framework for manip- 
ulating the content along the dimensions of modality and 
fidelity. The transcoding system selects the content from 
the InfoPyramid by assessing the various content alterna- 
tives to adapt the Internet content to the client devices. In 
this way, a wide range of client devices gain access to the 
large amounts of content on the Internet that is otherwise 
beyond their capabilities. 
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Abstract 

In this paper, we present a proxy based system 
(MOWSER) to support web browsing from mobile clients 
over wireless networks. Mowscr is a proxy agent between 
the. mobile host and the web server, which performs active 
transcoding of data on both upstream and downstream traf- 
fic to present web information to the mobile user according 
t;n the QaS parameters set fty the user. Active, transcoding is 
defined as modifying the HTTP stream in situ, and it is en- 
tirely transparent to the user. Further, our system does not 
pose any additional requirements on the mobile user. This 
is an improvement over other proxy based systems, which 
only transcodc images on the downstream and are mostly 
not configurable. White developed for mobile users, such a 
system can actually be useful in any low bandxvidth scenario. 



1. Introduction 

With an ever increasing amount of information "out 
there" on the World Wide Web, and mobility becoming 
the need of the hour, users want to have information at 
their fingertips wherever they may be. Recent develop- 
ments in mobile computing and web technologies have re- 
sulted in an increase in the number of "road warriors*, peo- 
ple who use mobile computers to access their business in- 
formation repositories, often through the web. These users 
need speedy access to that data. One cannot expect them 
to wait for hours to download a web page containing a lot 
of multimedia data over a wireless network, most of which 
cannot be handled by the small and ,: easy- to-carry w lap- 
top/palmtop computer anyway. Most mobile computers 
have very limited cpu. memory & disk resources. They 
communicate over wireless links which are characterized by 
lower band widths, higher error rates, and more frequent dis- 
connections. Moreover, in a mobile environment, changes 
to the network bandwidth and resources are very common. 
These arc only some, of the challenges of mobile computing, 
move, of which arc discussed in In the contexr of web 
browsing, web servers do nor consider the network connec- 
tion between I hem and the client. Tliey also have no con- 
cern for t he hardware r.apnhilil ies ol" t in- rlieiil . They just. 

I his work -.v:i-4 support it) part: by :m IBM I'aenUy Detvt-I 
opulent Award In A. Ji»shi 



return the document asked For assuming that the client is 
capable of properly receiving and displaying the data. Peo- 
ple are commonly creating web pages rich in multimedia 
data, pages with several images and videos have become 
very common. Presenting all this multimedia-rich data over 
wireless networks to the mobile user is a major challenge. 
The mobile computer either does not have the. appropriate 
hardware or adequate bandwidth (or both) to handle the 
content, and the web servers also cannot be modified to 
suit the mobile user, therefore, the only, way of bridging 
the gap between the highly resource-rich web servey at one 
end and the highly resource-poor mobile client at "the other, 
is by introducing a system in between which modifies the 
contents on the web and present it in the most appropriate 
form to the mobile user. Hence, tlie need for middleware, 
which adapts to the mobile environment, gets tlie be3t of 
what is available on I he web and does not pose any addi- 
tional requirements on the mobile client. 

We have devised such a middleware based solution which 
allows the user of a mobile computer to control the way 
in which the data from the web is retrieved, dynamically 
transform the data in a way that is transparent to the user, 
without requiring the mobile computer to do any additional 
work. We achieve this by introducing a proxy agent be- 
tween the mobile client and the web server. Our proxy, 
Mowser, lets the user specify and contror fehe viewing pref- 
erences and hardware capabilities of the client, transforms 
the data to and from the web server, without requiring any 
change on the client. We talk about related work in section 
2 and describe the software architecture of our system in 
section 3. In section 4. we describe the various methods of 
transforming data that we have used. The results of our 
experiments are presented in section 5 and we discuss it in 
section 6. 

2. Related Work 

The Client- Proxy- Server model has begun to feature in 
many mobile applications to overcome the challenges faced 
in the mobile computing scenario. However, only some of 
t hem actually transform the stream between the client and 
the server. 

In the OloMop model described in [5].[l l] the proxy 
performs "distillation" of the document received from the 
server before sending it to the clienl . Dist illation is defined 
here as highly Io*m\ real-time, datatype-specific compres- 
sion thai preserves most of (he semantic content- of a docu- 
ment.. The approach of transcoding image tiles is similar to 
our approach first outlined in [y], but video files are handled 



differently. They perform real-time transcoding of motion 
JPEG to sub-sampled H.261, while our approach, described 
later in this paper, represents a video stream using repre- 
sentative frames. GloMop also allows refinement of selected 
portions of the document. 

The Mowgli model [14] consists of two mediators, the 
Mowgli Agent and the Mowgli Proxy located on the mo- 
bile host and the mobile-connection host respectively, They 
use the Mowgli HTTP protocol to communicate with each 
other, which reduces the number, of round-trips between 
client and server. A : specialized transport service, the 
Mowgli Data Channel Service is used for reliable com muni- 
cation between the mobile-connection host and the mobile 
host. Mowgli WWW reduces the data transfer over the 
wireless link in three ways; data compression, caching, and 
intelligent filtering. It only performs GIF to JPKG conver- 
sion, and large embedded images are not transferred at all 
to the. mobile node. 

2enel proposes an architecture in [19] in which the proxy 
is made up of three components: High Level Proxy, Low 
Level Proxy and Event Manager. The High Level Proxy al- 
lows filters for application layer protocols to be downloaded 
dynamically from mobile host applications. The Low Level 
Proxy is used to create and install filters for the transport 
and network layers. A control interface is provided for the 
filters running within these proxies by the Event Manager. 

HTTP transducers called OreOs (defined in [3]) are spe- 
cialized processing modules having lour classes of function- 
ality: filtering HTTP requests and responses, characterizing 
sets of messages, transforming message contents and addi- 
tional processing indicated by the messages. The "stream 
model" approach of signal processing is used to perform 
complicated processing. 

In the work of Sat.hyanarayanan et. al, a module called 
Cellophane [17] on the client transforms HTTP requests 
from Netscape into file operations on Odyssey [16] web ob- 
jects and makes use of Odyssey. API to select fidelity levels 
for images which are forwarded to a distillation server. The 
distillation server distills the images to the requested fidelity 
before passing them back to the client. However, this ap- 
proach is specific to the Odyssey file system and- requires a 
modified version of the Net BSD kernel. This also requires 
the addition of a module on the client. 

Intel's Quick Web Technology [7] which sits on ISP 
servers compresses images by selectively dropping bits or 
pixels out of an image using lossy compression techniques, 
thereby speeding up the download of graphically- rich web 
pages. It also caches data to overcome the problem of band- 
width bulge. This can be used only when the access to 
the Internet is through ISP. and the user, requires a Java- 
enabled browser to have control over the compression. 

IBM's Web Express [6] consists of two components: AR- 
Tour (Advanced Radio (communications on Tour) Gateway 
and ARTour Client. The Gateway provides secure, com- 
pressed data across the selected network with authentica- 
tion. It can automatically retrieve Web requests in the 
background while mobile users are performing other tasks. 

In our system, the proxy performs active transcoding 
of HTTP requests horn the client, while sending it to the 
server, according to the preferences sel by the mobile host, 
sn that the dominant in the moM. suitable format is re- 
trieved. It ;i!mi processus the received HTTP data beture 
sending it lo the mobile host if necessary. 



3, Software i&reftitecture 

Mowscr is a proxy HTTP server agent (written in Perl] 
[18] which allows a mobile user to specify his or her view- 
ing preferences, based on the network connection and avail- 
able resources, and performs active transcoding of HTTP 
streams accordingly. The* software architecture that we 
propose introduces a proxy server on each Mobile Support 
Station (MSS) to the basic MSS-MH {Mobile Host) model 
which accepts and stores the preferences for each of its mo- 
bile hosts, acts as trie server to the mobile host, and as a 
client to the WWW server. No modifications to the web 
client on the MH is required. So any WWW browser that ;. 
can handle forms and has the provision of a proxy can be 
used. No additional software is required on the mobile host. 
Also, setting and updating preferences is done by just fill- 
ing up a (3G1 form on a URL at the web site maintained 
by. the proxy, server. 

In the initial versions of Mowser [9], [10], the proxy dealt 
with getting the viewing preferences for a MH from the user 
and storing it according to its IP address. The current ver- 
sion also stores the accept headers that will be most suitable 
for the MH based on the preferences set by the user. The 
viewing preferences stored for each MH include a starting 
point, color capability, video resolution, sound capability, 
maximum allowed size for text, image, video, audio files 
and files of unknown type, and the size reduction technique 
for image files. Not all of these variables are presently used 
by the proxy. The preferences can be updated by the user of 
the MH whenever there is a change in the network connec- 
tion or available resources. We are using an Apache HTTP 
server to store the preferences, aud GG1 scripts written in 
Tel and Perl are used to update and save the preferences. 

Once the MH sets the Proxy server as its proxy, all com- 
munication beiiween the MB and the W'W-W servers is di- 
rected through this proxy. When the proxy receives a re- 
quest from the MH. it looks up the preferences stored with 
the IP address of the MH and processes the request accord- 
ingly. Default preferences are used if no preferences had 
been specified. The proxy processes requests to set pref- 
erences and the GET requests. AW other requests are for- 
warded to the target WWW server, hi the next section, we 
detail how the proxy performs active transcoding of HTTP 
streams.' 

4. Active 3kaiiseoding 

Modifying the HTTP stream and changing its content in 
situ is called active transcoding. This is done dynamically 
without any user Intervention. For example, if an image file 
does not meet the size or color specifications, it is reduced 
before being sent to the MH (described in section '1.2), Sim- 
ilarly, sound files will not be sent to a MH with no sound 
capabilities, and so on. 

Traditionally, transcoding is a unidirectional process [9], 
[5], [6], [7]. In other words, the request from the client is 
passed as is to the target server, while the return stream's 
multimedia content is altered. In our work, we alter the 
request as well, so as to take advantage of some net- friendly 
features of JITTP/1.1. 

After setting preferences and making our MOWSER as 
its proxy, the user can browse the web as s/he would with 
any web client. On receiving a request from the Mil. the 
proxy fetches the preferences set by the MH and serves I he 
Mil with filets in the most suitable format. Default pref- 
erences are used if no values had been set by the Ml!. We 



process GtSI requests received From the MM be- 

fore sending it to the WWW server, and modify image and 
video files received from the WWW server before transmit- 
ting it to the M : H if necessary. <Fo support MHs with very 
limited resources and hardware capabilities like PDAs, we 
even parse the H3?Mli stream Co remove the active content 
and any tags that the MH cannot handle. The user may 
even choose to block any HTM Li file greater than a given 
size. With transcoding being done at two steps indepen- 
dently as shown in Figure J. we are making sure that, we 
match the preferences of the MH. while using the wired 
bandwidth in the most efficient manner. 

4.1. Transcoding of HTTP Requests 

HTTP/ 1.1 introduces the concept of content negotia- 
tion. The basic idea is that a WWW server may have sev- 
eral different representations of a resource. For example, it 
may store a document as postscript or word or HTML, etc. 
>Fhe server can automatically, choose the file to send if the 
client sends the preferred representations as part of each 
request. Most servers (e.g., apache), even though not fully 
1.1 compliant, already support content negotiation and will 
store files in several formats and in several variations of a 
format. We use this idea to get the file in the most ap- 
propriate format for the MH. For example, an image file 
may he in ade. available in varying resolutions by the con- 
tent provider on the server. We request the server to send 
the image file which has the resolution appropriate to the 
present QoS and client parameters, by including the prefer- 
ence in the request. This requires that the variants of a file 
have different mime types. For example, in our experiments 
we have used image/ x-sgif to denote an image file with very 
low resolution, image/x-mgif to denote one with medium 
resolution and image/x-lgif to denote one with large resolu- 
tion. Wc have also introduced vidco/x-rmpg to denote rep- 
resentative frames of video files (discussed in section 4.3). 

Any HTTP GET request received from the MH is 
munged to an HTTP 1.1 request and the complete URI 
is included in the request line. The Accept headers stored 
for the MH are then appended to the outgoing stream to 
request for the file in the format most suited for the MH. 
A Host header is added to complete the HTTP 1.1 request. 
The server performs content negotiation and sends tile file 
which closely meets the format specified in the request. ' 
Thus, the process is transparent to the user, and works even 
if the request comes from an HTTP^LO compliant browser, 
like most present commercial systems. 

For example, for a MH host on a low bandwidth line, 
the proxy may append the following Accept headers to tKc 
request after making it a HTTP 1.1 request: 

Accept: image/x-sgif. video/x-rmpg 

For a MH like the PalmPilot, which can handle only text 
and images, the proxy greatly reduces the data transfer by 
selectively GETing the files. That is, when the proxy re- 
ceives a GET request from a PDA : , it sends a HEAD request 
to the WWW server to get information about the content 
type of the file, and then GETs the file only, if the PDA 
can handle it. For example, the proxy does not request for 
audio, video and application files for a PDA. Since a page 
has the URLs of additional files to be fetched embedded in 
it. we could prevent (lie client on (.he MH from generating 
t he additional GET requests and design the proxy to decide 
whether to GET the file or not hy just looking at the exten- 
sion iif (Jin file name in the embedded URLs. However, the 
content-type of the file, is a bel ter indicator of t he format- of 



the file than the extension in the filename, though getting 
this information requires an additional HEAD request to 
he sent. 

4.2. Transcoding Image Piles 

When the proxy finds an image tag in the HTTP st ream 
received from the server, it reads the URI of the image 
file to be fetched and first sends a HEAD request to the 
server. It checks the content-type and content-length in- 
formation received from the server. If the content-length 
is small enough to be handled on the MH, the image file 
is sent to the MH unmodified. But if the image is larger 
tlian wliat can be handled by the MH, it is reduced in size 
or color as requested by the MH. The image files are scaled 
down in size, or the number of colors is reduced, or both 
without sacrificing semantics. On an image map for in- 
stance, size is not changed, only colors are. to preserve the 
semantics. The content-type information is used to decide 
the transformations that the image file has to go through. 
We convert all images to fee reduced to portable pixmap 
format for processing and then convert them back to gif 
format for displaying. Then the original U.REi in the image 
tag is replaced with the URL of the modified image stored 
locally by the proxy and sent to the MH. This makes the 
MH GET the modified image file from the proxy. To display 
images on PDAs, the proxy might have to reduce images to" 
2-bit gray scale and thumb size. 

4.3. Transcoding \£ideo Data 

Unlike image data where transcoding steps are obvi- 
ous, video data represents a great challenge. Simple sub- 
sampling, as proposed in [5]. is still not adequate as some 
clients may not have enough computational resources to do 
software decoding of MPEG or H.261. We use the struc- 
ture inherent in video streams to do the transcoding. The 
structure of video is a hierarchy of the movie or episode. 
This hierarchy is segments, scenes, and shots. Each seg- 
ment consists of sequence of scenes, each scene consists of 
several shots, and each shot is composed of several frames 
which Have similar visual properties: Inns one- of these 
frames can be selected as a Representative frame (Rframe) 
for the shot. 

We present the video to the user by the representative 
frames which are picked from each shot. Using techniques, 
we have developed [ij : [8j to support content based access 
to "networked video databases. Wc Have used several fuzzy 
clustering algorithms, sucft as fuzzy c-mean, hard c-mean, 
fuzzy c- median, hard c- median and possibilistic c-mean 
[2], [13], [12]. We use luminance and chrominance features, 
and 1-norm and 2-norm distance measures [If, [8] in order 
to group the frames which have similar properties together. 
Each group is classified as one shot. We pick the frame 
that is closest to center of each group to be Rframe. Wc do 
not explicitly use any scene change detection algorithms. 
T:he fuzzy techniques are used since frames can belong to 
the clusters to different degrees (membership values). Tra- 
ditional scene change algorithms, which insist on a frame 
belonging to only one group, break down when confronted 
with gradual scene changes typically found in videos. While 
Rf rames can be computed dynamically by the proxy, we feel 
that from a computational perspect ive, this should be done 
at the server side. In fact, it can be argued that 
these will typically be available at th« server side already 
to support querying and browsing of the video database. 



4.4. Transcoding Active Content 

Fur moliile hosts with limited memory and computa- 
tional resources, the user can decide not to receive any Java 
applets, JavaScripts, VBSeripts. etc. When such a prefer- 
ence is set. the document to be transmitted is parsed and 
all the active content is eliminated before sending it to the 
MH. This is specifically, suitable for the PDiAis which have 
very small disk space and low speed CPUs. Also, given re- 
strictions on the memory footprints of tlie applications that 
can reside on such machines, it is not clear that browsers 
will be able to support the virtual machines needed for ac- 
tive content languages such as Java. 

Often though, .lava (and JavaScript) are used to provide 
functionality, that can be duplicated using GG1 callbacks or 
server parsed HTML. An interesting option that wc wish 
to pursue here is to see if we could have the forms capable 
client request for a GGI version instead of the JavaScript 
from the server. In other words, replace active content of 
a page with equivalent dynamic content. This feature will 
be supported via content negotiation. Like all content ne- 
gotiation, it assumes that the server provides alternative 
versions (GGI based vs Java based) of a particular URL. 

4.5. transcoding HTMIi 

We can reduce the computation on the Mil by parsing 
H-FML tags on the proxy itself, rather that on the MH. We 
can eliminate all the tags that the MH does not support, 
and references to any file that the MH is not capable of 
handling. For example, we can eliminate I he italics tag. 
cascade style sheets, etc. for a PDA such as the Palm Pilot. 
For such severely resource constrained MHs, the set of tags 
that it can handle may be so small, that it is advantageous 
to strip of all unwanted tags at the proxy, and encode, the 
remaining tags using a few bits. 

By choosing the specific options, the user can use any or 
all of these transcoding methods depending on the limita- 
tions of the client or network connection, and can change it 
when resources change. This makes our proxy very adapt- 
able to serve the varying needs of the user. For example, a 
user on a laptop may. want, to only limit the size of video and 
audio files when s^he is connected via a slow telephone . mo- 
dem, and remove this restriction when connected through 
the e theme t. ft user on a on the other hand, will 

want to filter out everything except text and small images. 

5. Experimental results 

Our proxy server is a modified version of a HTTP server 
written in Perl. We are using an Apache server to store 
the preferences and to act as the WWW server capable of 
content-negotiation. We stored multiple formats of some 
files and requested them with different preference settings. 

i/\n example of transcoding due to content negotiation: 

We set the following preferences for one computer (iA; 
desktop) Maximum Image file size = 20K Maximum Video 
file size = 500K 

The accept headers added to its request were: Accept: 
image/x-lgif, image/gif:q=0.6, video/mpeg 

We requested for the page http:// bochi. cecs. missouri. 
edu: 902 L/ demo.html 

Figure 2 shows the response received. 

We set the following preferences for another computer (A 
laptop) Maximum linage file size = bl< Maximum Video file 
siz« = 25K 



The accept headers added to its request were: 
Accept: image^x-sgif, image/5gif;q=0.6. videofc-rmpg, 
video f. mp eg; q=0 . 6 

We requested for the same page fittp:/?/ bochi. cecs. 
missouri. edu: 9021/ demo.html 

Figure 3 shows the response received. 

An example of transcoding of images received from the 
server: 

For tlie same two computers, (same preferences set as 
above) we requested the page 

http :p www. missouri. edu /"csacm We image file existed 
only in gif format on the server. The image in the doc- 
ument is small enough for the desktop and hence passer] 
through without any reduction as seen in Figure 4. But 
the image is large for the laptop (larger than 6K). There- 
fore, lihe proxy reduced the resolution of tlie image as seen 
in Figure 5. 

To see some video files and their representative frames, 
please visit 
http:^meru.cecs.missoin*i.edu/"sansanee^mpeg 

We kept different settings on two computers and ac- 
cessed the same web page to see the difference. For the 
first computer we set a large value for the maximum size 
of image and video files allowed, and sent accept headers 
to allow large gif files and mpeg movie files. Therefore, wc 
received large image files and the entire movie file. For tlie 
second computer, we limited the size of image and video 
files, and sent accept headers requesting for small images 
and. representative frames of mpeg files. Hence, we received 
smaller versions of the images and only the representative 
frames for the movie file. Our request did not specify the 
extension of the file name, and the file was available in mul- 
tiple formats. If the size of the received image files is larger 
than the maximum size specified, the proxy scales it down 
either by size or by color as set by the user. 

Glearly, the results are Best understood by experienc- 
ing the proxy based model. We have made the proxy 
available on the web. it may be accessed at the l>'RIi 
http: //nirvana. cecs. missouri. edu :8G01. ft major drawback 
of the present implementation is its overhead. Since this 
is a demonstation prototype, it has been mostly in PERIi, 
and is thus may slow to execute. 

6. Discussion- . r- - 

In this paper, .we have presented a proxy based system 
(MOW'S BR) to support; web browsing from mobile plat- 
forms. It follows the client-proxy-server model which is 
the basis of most mobile applications and uses the proxy 
to provide active transcoding. Proxies are mostly used for 
forwarding data between the mobile client and the station- 
ary server, The idea of using transcoding at the proxy to 
support mobility is not new per se. Many proxy based sys- 
tems f5],[?3y*9] have been developed to provide web access 
to mobile users. However, they typically Cranscode the im- 
age data received from the WWW server before sending 
it to the mobile client, and are often not configurable. In 
Mowser, we extend the notion of transcoding to both the 
upstream and the downstream traffic. More specifically, the 
upstream request is raunged into a HTTP/1.1 request, to 
make use of the content negotiation feature, and Accept 
headers are appended, to request for the document in the 
format most appropriate for the ^oS parameters set by the 
mobile user. On the downstream, in addition to transcoding 
of images, we also provide the options of removing the active 
content and transcoding HTML. The mobile user can select 
any or all of these options. This is to support the changing 



requirements of a wide variety of mobile hosts ranging from 
a powerful notebook with 233MHz GVM : 2<3B RAM, which 
may require only, image and video data transcoding, to a 
PDA with 64KB dynamic RAM. which requires all possible 
transcoding and filtering of data. 

lu ongoing work, we ate expending the proxy to effec- 
tively use all the preferences set by Che user to limit or 
transform the data before serving the MH. Further, our 
proxy adds a performance overhead due to two reasons. 
First, it is written in Perl and uses netpbm for the process- 
ing of image files. We r peed could be increased by writing 
optimized G code and image conversion routines. Second, 
messages go all the way up to the application layer in the 
proxy even if data just needs to be written from one socket 
to another. Research is going on in the IBM Watson iiabs 
to avoid this by using Splice [15] which allows data to 
flow through without going to the application layer in the 
proxy if necessary and such techniques could eventually be 
integrated into our system. ^Ehc overhead is justifiable be- 
cause we are trading proxy side GPXJ cycles for the more 
expensive client side GPU cycles and network bandwidth. 
In experimental situations, it has been observed that extra 
time taken by the proxy is still less than the time needed 
to send untransformed data on the wireless network. 
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Figure 1. Active Transcoding of HTTP Stream 
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